Article 5421
Title of the article |
SPEECH/PAUSE SEGMENTATION METHOD BASED ON TEAGER ENERGY OPERATOR |
Authors |
Alan K. Alimuradov, Candidate of technical sciences, director of student research and production business incubator, Penza State University (40 Krasnaya street, Penza, Russia), E-mail: alansapfir@yandex.ru |
Index UDK |
004.934 |
DOI |
10.21685/2227-8486-2021-4-5 |
Abstract |
Background. Speech segmentation into voiced, unvoiced sections and pauses is the key task for the majority of speech applications. This is especially important in systems for assessing human psycho-emotional state by speech, since duration of voiced, unvoiced sections and pauses are informative parameters being relevant to naturally expressed human emotions. Materials and methods. The second-order differential Teager energy operator was used, which has a good amplitude that is highly susceptible to changes in signal amplitude and frequency. The method is implemented by means of the program © Matlab (MathWorks). Results. There has been developed a method for speech/pause segmentation to linearly divide a speech signal into fragments, to calculate the energy characteristic using the Teager energy operator, to calculate the values of short-term energy, and determine the «speech/pause» status of fragments based on the calculated threshold values of the short-term energy. There has been carried out a research on the developed method to assess the effectiveness of speech/pause segmentation over the classical method based on the analysis of short-term energy, has been carried out. Conclusions. In accordance with the obtained research results, there is an increase in the efficiency of speech/pause segmentation by 5.26 % and 5.51 % for the 1st and 2nd kind errors, respectively. The proposed speech/pause segmentation method can be effectively tested in systems for assessing human psycho-emotional state due to its good susceptibility to sudden changes in signal amplitude and frequency with unstable vocal motor skills. |
Key words |
speech signal processing, speech segmentation, voiced and unvoiced speech, Short-Time Energy, Teager Energy Operator |
![]() |
Download PDF |
For citation |
Alimuradov A.K. Speech/pause segmen tation method based on teager energy operator. Modeli, sistemy, seti v ekonomike, tekhnike, prirode i obshchestve = Models, sys- tems, networks in economics, technology, nature and society . 2021;(4):52–63. (In Russ.). doi:10.21685/2227-8486-2021-4-5 |
Дата обновления: 06.04.2022 13:00